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METHODS FOR ISOLATING AND CHARACTERIZING ENDOGENOUS 
mRNA-PROTEIN (mRNP) COMPLEXES 

RELATED APPLICATIONS 

This application claims the benefit of U.S. Provisional Application No. 
60/173,338, filed 28 December 1999, which is hereby incorporated in its entirety. 

5 STATEMENT OF FEDERAL SUPPORT 

This invention was made with government support under grant number ROI 
CA79907 from the National Institutes of Health. The United States government has 
certain rights to this invention. 

10 FIELD OF THE INVENTION 

This invention relates generally to post-transcriptional regulation and methods 
of profiling gene expression. 

BACKGROUND OF THE INVENTION 

15 Many diseases are genetically based, and the genetic background of each 

individual can have a profound effect on his or her susceptibility to disease. The 
relatively new field of functional genomics has provided researchers with the ability to 
determine the functions of proteins based upon knowledge of the genes that encode 
the proteins. A major goal of functional genomics is to identify gene products that 

20 are suitable targets for drug discovery. Such knowledge can lead to a basis for 
target validation if it is demonstrated that the target of interest has an essential 
function in a disease. Accordingly, a need exists to develop methods that allow 
profiling of the gene expression state of cells and tissues in order to understand the 
consequences of genetics on growth and development. 

25 Understanding global gene expression at the level of the whole cell requires 

detailed knowledge of the contributions of transcription, pre-mRNA processing, 
mRNA turnover and translation. Although the sum total of these regulatory 
processes in each cell accounts for its unique expression profile, few methods are 
available to independently assess each process en masse. 
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The expression state of genes in a complex tissue or tumor is generally 
determined by extracting messenger RNAs from samples (e.g., whole tissues) and 
analyzing the expressed genes using cDNA libraries, microarrays or serial analysis 
of gene expression (SAGE) methodologies. See, e. g., Duggan, ef a/., (1999) 

5 Nature Genetics 21 , 10-14.; Gerhold, ef a/. f (1 999) Trends in Biochemical Sciences 
24, 168-173; Brown, ef a/., (1999) Nature Genetics 21, 38-41; Velculescu, ef a/., 
(1995) Science 270, 484-487 Velculescu, ef a/. (1997) Cell 88, 243-251. In order to 
determine the gene expression profile of any single cell type within a tissue or tumor 
or to recover those messenger RNAs, the tissue must first be subjected to 

10 microdissection. This is very laborious, as only a small amount of cellular material is 
recovered and the purity as well as the quality of the cellular material is 
compromised. 

Post-transcriptional events influence the outcome of protein expression as 
significantly as transcriptional events. The regulation of transcription and post- 

15 transcription are generally linked. Altering the expression of transcriptional activators 
or repressors has important consequences for the development of a cell. Therefore, 
feedback loops following translational activation of specific mRNAs may change the 
program of transcription in response to growth or differentiation signals. DNA arrays 
are well-suited for profiling the steady-state levels of mRNA globally {i.e., total mRNA 

20 or the "transcriptome"). However, because of post-transcriptional events affecting 
mRNA stability and translation, the expression levels of many cellular proteins do not 
directly correlate with steady-state levels of mRNAs (Gygi et al. (1999) Mol. Cell Biol. 
19, 1720-1730; Futcher ef ai (1999) MoL Cell Biol. 19, 7357-7368). 

Many mRNAs contain sequences that regulate their post-transcriptional 

25 expression and localization (Richter (1996) in Translational Control, eds. J.W.B 

Hershey, ef a/., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, pp. 481- 
504). These regulatory elements reside in both introns and exons of pre-mRNAs, as 
well as in both coding and noncoding regions of mature transcripts (Jacobson and 
Peltz (1996) Annu. Rev. Biochem. 65, 693-739; Wickens et al. (1997) Curr. Opin. 

30 Genef. Dev. 7, 220-232). One example of a sequence-specific regulatory motif is the 
AU-rich instability element (ARE) present in the 3'-untranslated regions (UTRs) of 
early-response gene (ERG) mRNAs, many of which encode proteins essential for 
growth and differentiation (Caput etal. (1986) Proc. Natl. Acad. Sci. USA 83, 1670- 
1674; Shaw and Kamen (1986) Cell 46, 659-667; Schiavi et al. (1992) Biochim. 
35 Biophys. Acta 1114, 95-106; Chen and Shyu (1995) Trends Biochem. Sci. 20, 465- 
470). Regulation via the ARE is poorly understood, but the mammalian ELAV/Hu 
proteins have been shown to bind to ARE sequence elements in vitro and to affect 
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post-transcriptional mRNA stability and translation in vivo (Jain ef a/. (1997) Mol. Cell 
Biol. 17, 954-962; Levy etal. (1998) J. Biol. Chem. 273, 6417-6423; Fan and Steitz 

(1998) EMBO J. 17, 3448-3460; Peng et al. (1998) EMBO J. 17, 3461-3470; Keene 

(1999) Proc. Natl. Acad. Sci. USA 96, 5-7). 

5 In vitro RNA selection methods based upon cellular sequences are reported 

in Gao et a/., Proc. Natl. Acad. Sci USA 90, 11207-11211 (1994) and U.S. Patent 
Nos. 5,773,246, 5,525,495 and 5,444,149, all to Keene ef a/., the disclosures of 
which are incorporated herein in their entirety. Generally, these methods were 
intended to identify large numbers of mRNAs present in messenger RNP (mRNP) 

10 complexes, and utilized in vitro binding and amplification of mRNA sequences from 
large pools of naturally-occurring mRNAs. These studies used proteins (referred to 
as ELAV or Hu proteins) known to bind to AU-rich sequence elements present in the 
untranslated regions of cellular mRNAs. These experiments led to the discovery that 
mRNAs which are structurally or functionally related may be revealed using multi- 

15 targeted RNA binding proteins {i.e., RNA binding proteins that specifically bind more 
than one target). See Levine, ef a/., (1994) ef a/., Molecular and Cellular Biology 1 3, 
3494-3504; and King, etal., (1993) Journal of Neuroscience 14, 1943-1952; 
reviewed in Antic and Keene (1997) American Journal of Human Genetics 61, 273- 
278 and Keene (1999) Proceedings of the National Academy of Sciences (USA) 96, 

20 5-7. However, these reports are limited to in vitro applications, and do not describe 
in vivo methods for partitioning RNA into structural or functional subsets using RNA 
binding proteins. Although in vitro methods have been used to determine protein- 
RNA interactions, their use has certain limitations. Biochemical methods are 
generally reliable when carefully controlled, but RNA-binding can be problematic 

25 because many interactions may be of low affinity, low specificity or even artifactual. 
In order to understand RNA-protein interactions and their functional implications on a 
global systems level it is necessary to find reliable methods to monitor messenger 
RNP complexes in vivo. 

The successful immunoprecipitation of epitope-tagged ELAV/Hu protein 

30 which has been transfected into pre-neuronal cells has been reported. See Antic et 
al., Genes and Development 13, 449-461 (1999). This immunoprecipitation was 
followed by nucleic acid amplification that allowed for the identification of a 
messenger RNA encoding neurofilament M protein (NF-M). 

35 SUMMARY OF THE INVENTION 

The present invention relates to a new, in vivo approach for the determination 
of gene expression that utilizes the flow of genetic information through messenger 
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RNA clusters or subsets. Recently, the practice of examining multiple 
macromolecular events simultaneously and in parallel with the goal of organizing 
such information computationally has taken the designation "-ome." Thus, the 
genome identifies all of the genes of a cell, while the transcriptome is defined as the 
5 messenger RNA complement of the genome and the proteome is defined as the 
protein complement of the genome (see Figure 1). The present inventors have 
defined several physically organized subsets of the transcriptome and defined them 
as dynamic units of the "ribonome". As described herein, the ribonome consists of a 
plurality of distinct subsets of messenger RNAs (mRNAs) that are clustered in the 
10 cell due to their association with RNA-binding proteins (e.g., regulatory RNA-binding 
proteins). By identifying the mRNA components of a cellular ribonome, the cellular 
transcriptome can be broken down into a series of subprofiles that together can be 
used to define the gene expression state of a cell or tissue (see Figure 2). In 
combination with, for example, high throughput approaches and by multiplexing RNA 
15 processing assays, the present inventive methods provide the ability to determine 
the changes that occur in multiple gene transcripts simultaneously. 

Accordingly, one aspect of the invention is an in vivo method of partitioning 
endogenous, cellular mRNA-binding protein (mRNP) complexes. The method, in 
one embodiment, comprises contacting a biological sample that comprises at least 
20 one mRNP complex with a ligand that specifically binds a component of the mRNP 
complex. The biological sample may be, for example, a tissue sample, whole tissue, 
a whole organ, a cell culture, or a cell extract or lysate. The component of the mRNP 
complex may be a RNA binding protein, a RNA-associated protein, a nucleic acid 
associated with the mRNP complex including the mRNA itself, or another molecule 
25 or compound (e.g., carbohydrate, lipid, vitamin, etc.) that associates with the mRNP 
complex. The ligand may be, for example, an antibody that specifically binds the 
component, a nucleic acid that binds the component (e.g., an antisense molecule, a 
RNA molecule that binds the component), or any other compound or molecule that 
binds the component of the complex. The mRNP complex is then separated by 
30 binding the ligand (now bound to the mRNP complex) to a binding molecule that 
binds the ligand. The binding molecule may bind the ligand directly {i.e., may be an 
antibody specific for the ligand), or may bind the ligand indirectly {i.e., may be an 
antibody or binding partner for a tag on the ligand). The binding molecule will be 
attached to a solid support, such as a bead or plate or column, as known in the art. 
35 Accordingly, the mRNP complex will be attached to the solid support via the ligand 
and binding molecule. The mRNP complex is then collected by removing it from the 
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solid support (i.e., the complex is washed off the solid support using suitable 
conditions and solvents). 

The identity of the mRNA bound within the mRNP complex may then be 
determined, for example, by separating the mRNA from the complex, reverse 

5 transcribing the mRNA into cDNA, and sequencing the cDNA. 

In embodiments of the invention, therefore, the mRNP complex may be 
isolated by direct immunoprecipitation of the mRNP complexes, either with or without 
epitope tags, or by other biochemical partitioning methods. For example, other 
proteins bound to or associated with the mRNP complex may be immunoprecipitated 

10 in order to recover the mRNP complex and subsequently the mRNAs bound within 
the complex. The skilled artisan will appreciate that embodiments of the inventive 
method allow for the identification of a plurality of mRNA complexes simultaneously 
(/.e M concurrently), sequentially, or in batch-wise fashion. Alternatively, the method 
may be carried out on one biological sample (or portion thereof) numerous times, the 

15 steps of the method being performed in a sequential fashion, with each iteration of 
the method utilizing a different ligand. In any of the described embodiments, cDNA 
or genomic microarray grids, for example, may be used to identify mRNAs isolated 
by the inventive method en masse. 

A "subset" of mRNA is defined as a plurality of mRNA transcripts or 

20 messages that specifically bind or associate with a mRNP complex. In other words, 
subsets are defined by their ability to bind within or to a particular mRNP complex. 
The subset will preferably be a quantitative or qualitative fraction of the total mRNA 
population of the cell. Furthermore, subsets within subsets of mRNAs may be 
identified using the invention. The collection of mRNA subsets for any particular cell 

25 or tissue sample is an expression profile, also referred to herein as a "ribonomic 

profile," for that cell or tissue. It will be appreciated that expression profiles will differ 
from cell sample to cell sample, depending on the type of cell in the sample (e.g., 
what species or tissue type the cell is), the differentiation status of the cell, the 
pathogenicity of the cell {i.e., if the cell is infected or if it is expressing a deleterious 

30 gene, such as an oncogene, or if the cell is lacking a particular gene), the specific 
ligand used to isolate the mRNP complex, etc. Thus, the expression profile of a cell 
may be used as an identifier for the cell, enabling the artisan to compare and 
distinguish profiles of different cells. 

Stated otherwise, the ribonomic profile provides a pattern recognition subset 

35 of the global mRNA profile of the cell. When the growth state of the cells changes 
(i.e., tumorigenesis) or the cell is perturbed by a pathogen (i.e., a viral infection), the 
profile will change, and a perturbation of the ribonome can be detected. If cells are 
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treated with compounds {i.e., drugs) the ribonomic patterns will show desirable or 
undesirable alteration. Accordingly, the new method provides methods for 
evaluating the effect of numerous factors on a cell, including toxicity, aging, 
apoptosis, pathogenesis and cell differentiation. 

5 The new invention has several advantages over previous methods of 

partitioning RNA. First, partitioning of mRNP complexes may be carried out in vivo, 
while previous methods were limited to in vitro applications. The new method is 
robust enough such that amplification (e.g., by PCR, or alternatively according to the 
method of Antic et al. (1 999) Genes Dev. 1 3, 449-461 ) is not necessary to identify 

10 cDNAs of interest once they are reverse transcribed from the isolated subset of 

mRNAs. The present invention does not require the use of iterative processes, such 
as those set forth in Gao ef ai supra. Finally, quantitative determinations are 
possible with the present invention if, for example, hybridization is used to analyze 
the expression profile of the cell (e.g., in microarray assays or RNAse protection 

15 assays (RPA)). 

In certain embodiments, therefore, the present invention advantageously 
allows the artisan to identify, monitor, and quantitate mature gene transcripts en 
masse in order to determine their localization, activity, stability, and translation into 
protein components of living cells. The methods described herein advantageously 

20 provide a novel approach to functional genomics by providing methods of isolating 
endogenous messenger-RNA binding proteins, and methods of identifying the subset 
of cellular mRNAs contained in mRNP-complexes, using microarrays or other known 
procedures. In preferred embodiments, the inventive method provides a basis for 
investigating and determining functional mRNA networks during growth and 

25 differentiation cycles by using mRNA-binding proteins and other mRNP-associated 
factors to define mRNA subsets. 

It will be appreciated that patterns of mRNA subsets {i.e., expression profiles) 
may be altered in the presence of certain compounds {i.e., drugs) or under various 
disease conditions. Accordingly, in certain embodiments the inventive methods are 

30 useful for screening compounds that may be of therapeutic use, and for finding 
appropriate gene targets for the compounds. In other embodiments, the inventive 
method is useful for determining the disease state of a cell, thus providing means for 
classifying or diagnosing the presence or predisposition for disease (e.g., cancer). 

Gene expression profiles will also vary between differing ceil types present in 

35 a complex tissue, such as a tumor. Some mRNA binding proteins are present only in 
certain tumor cells, and a tumor may comprise more than one cell type. Gene 
expression profiling for each cell type within a tumor or tissue may be carried out by 
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making an extract of the tissue and immunoprecipitating cell-type specific 
components of mRNP complexes (e.g., RNA-binding proteins that are attached to 
mRNA) directly from the extract {i.e., in vivo). The immunoprecipitated pellets will 
contain mRNAs that are only present in the same cells that contain the attached or 

5 associated component. Thus, in certain embodiments, the inventive methods may 
be used to characterize and distinguish the gene expression profiles of a plurality of 
cell types, which cell types may co-exist in the same complex tissue. This can allow 
the tumor cells to be profiled in whole tumor extracts without having to analyze 
mRNA in, for example, the non-tumor stromal cells and blood cells that surround 

10 tumor cells. The results of such characterization may be useful in determining, for 
example, the proper course of treatment for a patient suffering with a tumor, when 
the choice of treatment depends of the kind of tissue (e.g., endothelial vascular 
tissue) present in a tumor. 

In another embodiment, the present invention provides methods for isolating 

15 and optionally identifying proteins that bind or associate with a mRNP complex. 

Alternatively, and in another embodiment, the inventive method may be used 
to screen test compounds for their ability to modulate gene expression in a cell. 
Such methods are useful for screening putative drugs that may be used in the 
treatment and/or prevention of disorders associated with irregularities in gene 

20 expression, including but not limited to cancer. 

The foregoing and other aspects of the present invention are explained in 
detail in the specification set forth below. 

BRIEF DESCRIPTION OF THE DRAWINGS 
25 Figure 1 is a schematic illustrating the flow of genetic information from the 

genome to the proteome, and the intermediate levels represented by the ribonome 
and the transcriptome. The transcriptome represents the total mRNA complement of 
the genome, but does not necessarily correlate directly with protein production. The 
processing, transport and translation of mRNA occurs in the ribonome, where 
30 dynamic regulatory steps determine the proteomic outcome. 

Figure 2 graphically illustrates a comparison of the total cell mRNA (the 
transcriptome) with mRNA that has bound within mRNP complexes to form a part of 
the ribonome. The microarrays representing mRNP complexes contain discrete and 
more limited subsets of mRNAs, when compared to the transcriptome. 
35 Figures 3A and 3B illustrate multi-probe RNase protection analysis of 

mRNAs associated with mRNP complexes. Messenger RNP complexes from cell 
lysates were immunoprecipitated, and the pelleted RNA was extracted and 
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quantitated by RNase protection. Figure 3A and 3B show examples of mMyc and 
mCyc-1 multi-probe template sets, respectively. Lanes: (1) undigested riboprobe 
(slightly larger than RNase-digested product due to riboprobe plasmid template); (2) 
total cellular RNA; (3) rabbit pre-bleed serum control; (4) mRNAs extracted from HuB 

5 mRNPs; (5) mRNAs extracted from PABP mRNPs. An asterisk (*) denotes mRNA 
species not detected in total RNA. 

Figure 4 illustrates ribonome profiling of RNA subsets using DNA arrays. 
The RNA-protein complexes can be derived from cells of two individuals, species, 
cell types, treatments, developmental stages, efc. mRNA-protein complexes are 

10 separated immunoprecipitations of complexes are conducted, probes are reverse- 
transcribed from the RNA template, and a DNA array of genes is probed with each 
pool of RNP-derived probes to generate subprofiles of gene expression (10). 
Subprofiles are then compared by subtraction or addition to generate an overall 
picture of gene expression (20). This figure depicts the ribonomic concept, in which 

15 different mRNPs are isolated and their associated mRNAs identified using 

microarrays. The subprofiles (mRNP1, mRNP2, ....mRNPn) of the total cell profile 
are shown as additive. Stacked mRNP subprofiles can each represent individual 
mRNPs within a single cell type, or can represent each individual cell's transcriptome 
within a complex tissue or tumor. 

20 Figure 5 sets forth the results of illustrative Example 4, below, and shows 

mRNAs associated with mRNP complexes using cDNA arrays. Panels: (A) pre- 
bleed; (B) HuB mRNP complexes; (C) elF-4E mRNP complexes; (D) PABP mRNP 
complexes; (E) total cellular RNA. An example of the specificity of the procedure is 
indicated by the differential abundance of the mRNAs encoding (3-actin and 

25 ribosomal protein S29 among the mRNP profiles (arrows a and b, respectively). 
Other examples of such specificity are readily observable with other mRNAs (data 
not shown). 

Figure 6 sets forth the results of illustrative Example 5, below, and shows a 
comparison of the mRNA profiles from HuB mRNPs before and after treatment with 

30 retinoic acid (RA). Panels: (A) mRNAs extracted from HuB mRNPs 

immunoprecipitated from untreated cells; (B) mRNAs extracted from HuB mRNPs 
immunoprecipitated from RA-treated cells; (C) a computer-generated comparison of 
panels A and B; (D) mRNAs extracted from HuA (HuR) mRNPs immunoprecipitated 
from untreated cells; (E) mRNAs extracted from HuA mRNPs immunoprecipitated 

35 from RA-treated cells; (F) a computer-generated comparison of panels D and E; (G) 
total complement of mRNAs extracted from untreated cellular lysate; (H) total 
complement of mRNAs extracted from RA-treated cellular lysate; and (I) a computer- 
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generated comparison of panels G and H. For panels C, F, and I: green bars 
indicate mRNAs of approximately equal abundance; red bars represent mRNAs from 
HuB mRNPs that were detectable at four-fold or greater following RA treatment; blue 
bars represent mRNAs from HuB mRNPs that were detectable four-fold or greater in 
5 cells before RA treatment. 

Figure 7 is a schematic of ribonomic profiling. 

Figure 8 is a schematic outlining a strategy for the identification of new RNA- 
binding proteins. 

10 DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

The present invention will now be described more fully with reference to the 
accompanying drawings, in which preferred embodiments of the invention are 
shown. This invention may, however, be embodied in different forms and should not 
be construed as limited to the embodiments set forth herein. Rather, these 

15 embodiments are provided so that this disclosure will be thorough and complete, and 
will fully convey the scope of the invention to those skilled in the art. 

The terminology used in the description of the invention herein is for the 
purpose of describing particular embodiments only, and is not intended to be limiting 
of the invention. As used in the description of the invention and the appended 

20 claims, the singular forms "a", "an" and "the" are intended to include the plural forms 
as well, unless the context clearly indicates otherwise. Unless otherwise defined, all 
technical and scientific terms used herein have the same meaning as commonly 
understood by one of ordinary skill in the art to which this invention belongs. All 
publications, patent applications, patents, and other references mentioned herein are 

25 incorporated by reference in their entirety. 

Except as otherwise indicated, standard methods may be used for the 
production of cloned genes, expression cassettes, vectors, and transformed cells 
and plants according to the present invention. Such methods are known to those 
skilled in the art. See e.g., J. Sambrook et a/., Molecular Cloning: A Laboratory 

30 Manual Second Edition (Cold Spring Harbor Laboratory, Cold Spring Harbor, New 
York, 1989); F. M. Ausubel et a/., Current Protocols In Molecular Biology (Green 
Publishing Associates, Inc. and Wiley-lnterscience, New York, 1991). 

Nucleotides and amino acids are represented herein in the manner 
recommended by the IUPAC-IUB Biochemical Nomenclature Commission, or (for 

35 amino acids) by three letter code, in accordance with 37 C.F.R. §1 .822 and 

established usage. See, e.g., Patentln User Manual, 99-102 (Nov. 1990) (U.S. 
Patent and Trademark Office). 
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Nucleotide sequences are presented herein by single strand only, in the 5' to 
3' direction, from left to right. "Nucleic acid sequence" as used herein refers to an 
oligonucleotide, nucleotide, or polynucleotide, and fragments thereof, and to DNA or 
RNA of genomic or synthetic origin which may be single- or double-stranded, and 

5 represent the sense or antisense strand. The term "nucleic acid" refers to 

deoxyribonucleotides or ribonucleotides and polymers thereof in either single- or 
double-stranded form. Unless specifically limited, the term encompasses nucleic 
acids containing known analogues of natural nucleotides which have similar binding 
properties as the reference nucleic acid and are metabolized in a manner similar to 

10 naturally occurring nucleotides. Unless otherwise indicated, a particular nucleic acid 
sequence also implicitly encompasses conservatively modified variants thereof (e.g. 
degenerate codon substitutions) and complementary sequences, and as well as the 
sequence explicitly indicated. Two nucleic acids are "recombined" when sequences 
from each of the two or more nucleic acids are combined in a progeny nucleic acid. 

1 5 The terms "nucleic acid" or "nucleic acid sequence" may also be used in 

reference to genes, cDNA, and mRNA encoded by a gene. The term "gene" is used 
broadly to refer to any segment of DNA associated with a biological function. Thus, 
genes include coding sequences and/or the regulatory sequences required for their 
expression. Genes also include non-expressed DNA segments that, for example, 

20 form recognition sequences for other proteins. Genes can be obtained from a variety 
of sources, including cloning from a source of interest or synthesizing from known or 
predicted sequence information. 

As used herein, a nucleic acid molecule may be RNA (the term "RNA" 
encompassing all ribonucleic acids, including but not limited to pre-mRNA, mRNA, 

25 rRNA, hnRNA, snRNA and tRNA); DNA; peptide nucleic acid (PNA, as described in, 
e.g., U.S. Patent No.5,539,082 to Nielsen ef a/., and U.S. Patent No. 5,821 ,060 to 
Arlinghaus ef a/.); and the analogs and modified forms thereof. Preferably, the 
nucleic acid is RNA, and more preferably the nucleic acid molecule is messenger 
RNA (mRNA). Nucleic acid molecules of the present invention may be linear or 

30 circular, an entire gene or a fragment thereof, full-length or fragmented/digested, 

"chimeric" in the sense of comprising more than one kind of nucleic acid, and may be 
single-stranded or double-stranded. Nucleic acid from any source may be used in 
the present invention; that is, nucleic acids of the present invention include but are 
not limited to genomic nucleic acid, synthetic nucleic acid, nucleic acid obtained from 

35 a piasmid, cDNA, recombinant nucleic acid, and nucleic acid that has been modified 
by known chemical methods, as further described herein. Nucleic acids may also be 
products of in vitro selection experiments (also called aptamers) and other nucleic 
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acid molecules useful for their ability to bind or be bound by other ligands. See D. 
Kenan, TIBS 19, 57-64 (1994); L. Gold, et a/., Annu. Rev. Biochem. 64, 763-798 
(1995); S.E. Osborne and A.D. Ellington, Chem. Rev. 97, 349-370 (1997). 

Nucleic acids of the present invention may be obtained from any organism, 

5 including but not limited to bacteria, viruses, fungi, plants and animals, with animal 
nucleic acid being preferred, mammalian nucleic acid being more preferred, and 
human nucleic acid being most preferred. If desired, the nucleic acid may be 
amplified according to any of the known nucleic acid amplification methods that are 
well-known in the art (e.g., PCR, RT-PCR, QC-PCR, SDA, and the like). Nucleic 

10 acids of the present invention may be, and preferably are, purified according to 
methods known in the art. 

As summarized above, the present invention relates to in vivo methods for 
partitioning mRNP complexes from a cell. mRNP complexes of the present invention 
is preferably from a biological sample, such as a tissue sample, whole tissue, a 

15 whole organ (e.g., an entire brain, liver, kidney, etc.), bodily fluid sample, cell culture, 
cell lysate, cell extract or the like. In a preferred embodiment, the biological sample 
comprises or is obtained from a population of cells. By a "population of cells" herein 
is meant at least two cells, with at least about 10 3 being preferred, at least about 10 6 
being particularly preferred, and at least about 10 8 to 10 9 being especially preferred. 

20 The population or sample can contain a mixture of different cell types from either 
primary or secondary cultures, or from a complex tissue such as a tumor, or may 
alternatively contain only a single cell type. In a preferred embodiment, cells that are 
proliferating are used. Alternatively, non-proliferating cells may be used. 

Preferred cell types for use in the invention include, but are not limited to, 

25 mammalian cells, including animal (rodents, equines, bovines, canines, felines and 
primates), and humarvcells, with human cells being preferred. Cells from non- 
mammalian animals (e.g., avians, fish, reptiles) and from plants may also be used in 
the practice of the present invention. Cells may be tumor cells from tumors of any 
type, including breast, skin, lung, cervix, colorectal, and brain/CNS tumors, etc. 

30 Additionally, non-cancerous cells from any organ may be used, including liver cells, 
neurons, muscle cells, and the like. 

mRNA is referred to herein interchangeably as a "message" or a "transcript". 
A "subset" of mRNA is defined as a plurality of mRNA that specifically binds within a 
particular mRNA binding protein complex (mRNP complex). Thus, subsets are 

35 defined by their ability to bind within or to a particular mRNP complex. The mRNA 
subset will preferably be a fraction of the total mRNA population of the cell. 
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As summarized above, one aspect of the invention is an in vivo method of 
partitioning endogenous cellular mRNA-binding protein (mRNP) complexes. 
"Endogenous" is used herein to mean that the mRNP complex forms in a cell {i.e., in 
vivo or in situ). The mRNP complex may form in the cell naturally, i.e., the 

5 components of the mRNP complex naturally occur in the cell and form the mRNP 
complex. Alternatively, the mRNP complex forms in a cell, even though one or more 
components of the complex is introduced into the cell by, e.g., infection or 
transformation. For example, a mRNP complex endogenously forms in a cell when 
a RNA-binding protein that is a component of the mRNP complex is ectopically 

10 expressed in the cell by (for example) transforming the cell or infecting the cell with 
an expression vector that carries nucleic acid encoding the protein, and a mRNP 
complex in which the protein binds is formed. 

The method, in one embodiment, comprises contacting a biological sample 
that comprises at least one mRNP complex with a ligand that specifically binds a 

15 component of the mRNP complex. The component of the mRNP complex may be a 
RNA binding protein, a RNA-associated protein, a nucleic acid associated with the 
mRNP complex including the mRNA itself, or another molecule or compound (e.g., 
carbohydrate, lipid, vitamin, etc.) that associates with the mRNP complex. A 
component "associates" with a mRNP complex if it binds or otherwise attaches to the 

20 mRNP complex with a Kd of about 1CT 6 to about 10" 9 . In a preferred embodiment, the 
component associates with the complex with a Kd of about 10" 7 to about 10' 9 . In a 
more preferred embodiment, the component associates with the complex with a Kd 
of about lO^to about 10~ 9 . 

The ligand may be any molecule that specifically binds the component of the 

25 mRNP complex. For example, the ligand may an antibody that specifically binds the 
component, a nucleic acid that binds the component (e.g., an antisense molecule, a 
RNA molecule that binds the component), or any other compound or molecule that 
specifically binds the component of the complex. In certain embodiments, the ligand 
may be obtained by using the serum of a subject (i.e., a human or animal subject) 

30 that has a disorder known to be associated with the production of mRNP-complex 
specific antibodies or proteins. Examples of these disorders include autoimmune 
disorders such as systemic lupus erythematosus ("lupus" or SLE) and a number of 
cancers. In certain embodiments, the ligand may be "tagged" with another 
compound or molecule in order to facilitate the separation, observation or detection 

35 of the ligand. In one embodiment of the invention, the ligand is "epitope tagged," as 
described in the art. Suitable tags are known in the art and include but are not 
limited to biotin, the MS2 protein binding site sequence, the UlsnRNA 70k binding 
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site sequence, the UlsnRNA A binding site sequence, the g10 binding site sequence 
(commercially available from Novagen, Inc., Madison, Wisconsin, USA), and FLAG- 
TAG® (Sigma Chemical, St. Louis, Missouri, USA). 

The mRNP complex is then separated by binding the ligand (now bound to 

5 the mRNP complex) to a binding molecule that specifically binds the ligand. The 
binding molecule may bind the ligand directly (i.e., may be an antibody or protein 
specific for the ligand), or may bind the ligand indirectly (i.e., may be an antibody or 
binding partner for a tag on the ligand). Suitable binding molecules include but are 
not limited to protein A, protein G, streptavidin. Binding molecules may also be 

10 obtained by using the serum of a subject suffering from, for example, an autoimmune 
disorder or cancer. In certain embodiments, the ligand is an antibody that binds the 
component of the mRNP complex via the Fab region of the antibody, and the binding 
molecule in turn binds the Fc region of the antibody. The binding molecule will be 
attached to a solid support, such as a bead, well, pin, plate or column, as known in 

15 the art. Accordingly, the mRNP complex will be attached to the solid support via the 
ligand and binding molecule. 

The mRNP complex is then collected by removing it from the solid support 
(i.e., the complex is washed off the solid support under appropriate stringency 
conditions, using suitable solvents that may be determined by skilled artisans). 

20 In certain embodiments of the invention, the mRNP complex may be 

stabilized by cross-linking prior to binding the ligand thereto. Cross-linking, as used 
herein, means covalently binding (e.g., covalently binding the components of the 
mRNP complex together). Cross-linking may be contrasted with ligand-target 
binding, or binding molecule-ligand binding, which is generally non-covalent binding. 

25 Cross-linking may be carried out by physical means (e.g., by heat or ultraviolet 
radiation), or chemical means (e.g., by contacting the complex with formaldehyde, 
paraformaldehyde, or other known cross-linking agents), which means are known or 
determinable by those skilled in the art. In other embodiments, the ligand may be 
cross-linked to the mRNP complex after binding the mRNP complex. In additional 

30 embodiments, the binding molecule may be cross-linked to the ligand after binding to 
the ligand. In yet other embodiments, the binding molecule may be cross-linked to 
the solid support. 

The skilled artisan will appreciate the inventive method allows for the 
identification of a plurality of mRNP complexes simultaneously (e.g., "en masse"). 
35 For example, a biological sample may be contacted with a plurality of ligands specific 
for different mRNP complex components. A plurality of mRNP complexes from the 
sample will bind the various ligands. The plurality of mRNP complexes can then be 
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separated using appropriate binding molecules, thus isolating the plurality of mRNP 
complexes. The mRNP complexes and the mRNAs contained within the complexes 
may then be characterized and/or identified by methods described herein and known 
in the art. Alternatively, the method may be carried out on one sample numerous 
5 times, the inventive steps being performed in a sequential fashion, with each iteration 
of steps utilizing a different ligand. 

As set forth above, a subset of mRNA identifies a pattern-recognition profile 
that is characteristic of the RNA structural or functional networks in that sample. The 
collection of mRNA subsets for any particular cell or tissue sample constitutes a 

10 gene expression profile, and more specifically a ribonomic gene expression profile, 
for that cell or tissue. It will be appreciated that ribonomic expression profiles may 
differ from cell to cell, depending on the type of cell in the sample (e.g., what species 
or tissue type the cell is), the differentiation status of the cell, the viability of the cell 
(i.e., if the cell is infected or if it is expressing a deleterious gene, such as an 

15 oncogene, or if the cell is lacking a particular gene or not expressing a particular 
gene), the specific ligands used to isolate the mRNP complexes, etc. Thus, the 
ribonomic expression profile of a cell may be used as an identifier for the cell, 
enabling the artisan to compare and distinguish profiles or subprofiles of different 
cells. The genes identified by the RNAs present in each ribonomic pattern form 

20 distinct subsets that may be associated with a particular cell cycle, stage of 
differentiation, apoptosis or stress induction, viral infection, or cancer. 

cDNAs may be used to identify mRNP complexes partitioned with a ligand or 
ligands specific for a component of the mRNP complex. cDNA microarray grids, for 
example, may be used to identify mRNA subsets en masse. Microarrays are 

25 precisely aligned grids in which each target nucleic acid (e.g., gene) has a position in 
a matrix of carefully spotted cDNAs. See Gerhold et a/, supra, Duggan et a/, supra, 
and Brown et a/., supra. Alternatively, genomic microarrays (e.g., microarrays 
wherein the target nucleic acids may contain introns and exons) may be used. 
Therefore, each gene or target nucleic acid being examined on a microarray has a 

30 precise address that can be located, and the binding can be quantitated. 

Microarrays in the form of siliconized chips or those based upon cDNA blots on nylon 
or nitrocellulose are commercially available. Glass slides can also be customized 
with oligonucleotides or DNAs for detection of complementary RNA sequences. In 
all of these cases, the hybridization platforms allow identification of the mRNAs in a 

35 sample based upon the stringency of binding and washing. This has been referred to 
as "sequencing by hybridization." Although microarray technology is one method of 
analysis, it is only one way to identify and/or sequence the mRNAs in the mRNA 
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subset. Alternative approaches include but are not limited to differential display, 
phage display/analysis, SAGE or simply preparing cDNA libraries from the mRNA 
preparation and sequencing all members of the library. 

Methods for DNA sequencing which are well known and generally available in 

5 the art may be used to practice any of the embodiments of the invention. The 

methods may employ such enzymes as the Klenow fragment of DNA polymerase I, 
SEQUENASE® (US Biochemical Corp, Cleveland, Ohio), Taq polymerase (Perkin 
Elmer), thermostable T7 polymerase (Amersham, Chicago, IIL), or combinations of 
polymerases and proofreading exonucleases such as those found in the ELONGASE 

10 Amplification System marketed by Gibco/BRL (Gaithersburg, Md.). Preferably, the 
process is automated with machines such as the Hamilton Micro Lab 2200 (Hamilton, 
Reno, Nev.), Peltier Thermal Cycler (PTC200; MJ Research, Watertown, Mass.) and 
the ABI Catalyst and 373 and 377 DNA Sequencers (Perkin Elmer). 

In a preferred embodiment, amplification of the mRNA isolated according to 

15 the present invention, and/or the cDNA obtained from the mRNA is not carried out 
during the identification of the nucleic acid, and is not necessary or required by the 
present invention. However, the skilled artisan may choose to amplify the nucleic 
acid that is the subject of identification (e.g., the nucleic acid being identified via 
microarray analysis and/or sequencing) for convenience, as a matter of preference, 

20 and/or to comply with the specification/instructions of certain commercially available 
microarrays or microarray analysis systems. Thus, if desired, the nucleic acid may 
be amplified according to any of the numerous known nucleic acid amplification 
methods that are well-known in the art (e.g., PCR, RT-PCR, QC-PCR, SDA, and the 
like). 

25 Methods of the present invention may be carried out in several ways, 

according to the needs of the practitioner and the purpose for which the invention is 
carried out. For example, in one embodiment, mRNA-binding protein complexes that 
are unique to a cell type of interest are identified. In an example of such an 
embodiment, an antibody that is specific for the mRNP complex can be used to 

30 immunoprecipitate the complex with its associated mRNAs. The RNAs may then 
identified to form the ribonomic expression profile of that cell type, or alternatively 
may be isolated for (as an example) drug screening. The mRNA candidates for post- 
transchptional regulation may be analyzed en masse, as a subset, for changes in 
mRNA stability during the cell cycle or developmental events. In certain 

35 embodiments, the methods may be carried out by isolating nuclei from cells 

undergoing developmental or ceil cycle changes, performing nuclear run-off assays 
according to known techniques to obtain transcribing mRNAs, and then comparing 
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the transcribing mRNAs with the global mRNA levels in the same cells using cDNA 
microarrays. These methods thus provide the ability to distinguish transcriptional 
from post-transcriptional effects on steady state mRNA levels en masse. 

In another embodiment, cells in culture are transformed to express a RNA- 

5 binding protein (RBP) or RNA-associated protein (RAP) that will associate with particular 
mRNAs only in a cell type of interest. DNA encoding the RBP or RAP may be carried by 
a recombinant vector (e.g., a plasmid, a viral vector) and transformed into the cell by 
known means, after which the RBP or RAP is expressed in the cell. Any RBP or RAP 
can be used, as described further herein. The protein may be in its native form, or it may 

10 be tagged (e.g., epitope tagged) for easy recovery from the cell. Detection of multiple 
RNA targets in vivo that are bound or associated with RBPs or RAPs may be carried out 
by using accessible epitopes, if necessary, but preferably is carried out without tags. In 
cases where the epitopes on the RBPs or RAPs are inaccessible or obscured, epitope 
tags on ectopically expressed recombinant proteins may be used. The transformed cell 

15 may be mixed with other cell types or may be implanted in an animal or human subject. 
A ligand (e.g., an antibody) that is specific for the protein can used to immunoprecipitate 
the protein with its associated messenger RNAs from an extract of a tissue containing 
the transformed cell. The mRNA complexes and its associated RNAs may then 
identified to form the expression profile of that cell type or is otherwise analyzed (e.g., for 

20 drug development). 

In still another embodiment, a specific cell type in an animal is engineered with 
one or more cell-type specific gene promoters to express a RBP or RAP in the cell type 
of interest. TKs set forth above, the gene promoter and the RBP or RAP may be carried 
on one or more vectors and transformed into the cell, where the RBP or RAP is 

25 expressed. In one embodiment, a ligand (e.g., an antibody) that is specific for this 
protein can used to immunoprecipitate the protein with its attached or associated 
messenger RNAs from an extract of a tissue containing the cell type of interest. The 
RNAs are then identified to form the expression profile of that cell type or isolated, e.g., 
for drug development. 

30 RNA binding proteins (RBPs) and RNA-associated proteins (RAPs) useful in 

the practice of the present invention are known in the art, or may alternatively be 
identified and discovered by methods described herein. RNA binding proteins are 
now known to be involved in the control of a variety of cellular regulatory and 
developmental processes, such as RNA processing and compartmentalization, RNA 

35 stabilization, mRNA translation and viral gene expression. RNA binding proteins 
include poly A-binding protein ("PABP," which gives rise to a subset of the total 
mRNA population that is quantitatively different from the total mRNA population), and 
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other general RNA binding proteins, as well as RNA-binding proteins that are 
attached to only one or a few messenger RNAs in a particular cell type. Other useful 
proteins are autoantibodies reactive with RNA and RNA-binding proteins. 
Examples of useful RNA binding proteins include the four ELAV/Hu 
5 mammalian homologues of the Drosophila ELAV RNA-binding protein (Good (1995) 
Proc. Natl. Acad. Sci. USA 92, 4557-4561; Antic and Keene, supra. HuA (HuR) is 
ubiquitously expressed while HuB, HuC and HuD (and their respective alternatively- 
spliced isoforms) are predominantly found in neuronal tissue, but can also be 
expressed as tumor cell-specific antigens in some small cell carcinomas, 

10 neuroblastomas, and medulloblastomas (reviewed in Keene (1999) Proc. Natl. Acad. 
Sci. USA 96, 5-7). All Hu proteins contain three RNA-recognition motifs (RRMs), 
which confer their binding specificity for AREs (Antic and Keene, supra] Kenan et al. 
(1991) Trends Biochem. Sci. 16, 214-220; Burd and Dreyfuss (1994) Science 265, 
615-621). The evidence for ARE binding by Hu proteins began with the identification 

15 of an AU-rich binding consensus sequence from a randomized combinatorial RNA 
library that was screened with recombinant HuB (Levine et al. (1993) MoL Cell Biol. 
13, 3494-3504; Gao et al. (1994) Proc. Natl. Acad. Sci. USA 91, 11207-11211). 
These and other studies demonstrated that Hu proteins bind in vitro to several ARE- 
containing ERG mRNAs including c-myc, c-fos, GM-CSF and GAP-43 (Levine et al. 

20 (1993) Mol. Cell Biol. 13, 3494-3504; Gao et al. (1994) Proc. Natl. Acad. Sci. USA 
91, 11207-11211; King ef al. (1994) J. NeuroscL 14, 1943-1952; Liu et al. (1995) 
Neurology 45, 544-550; Ma et al (1996) J. Biol. Chem. 271, 8144-8151; Abe et al. 
(1996) Nucleic Acids Res. 24, 2011-2016; Chung et al. (1997) J. Biol. Chem. 272, 
6593-6598; Fan and Steitz (1998) EMBO J. 17, 3448-3460; Antic ef al. (1999) Genes 

25 Dev. 13,449-461). 

The binding of Hu proteins to ARE-containing mRNAs can result in the 
stabilization and increased translatability of the mRNA transcripts (Jain et al. (1997) 
Mol. Cell Biol. 17, 954-962; Levy ef al. (1998) J. Biol. Chem. 273, 6417-6423; Fan 
and Steitz (1998) EMBO J. 17, 3448-3460; Peng et al. (1998) EMBO J. 17, 3461- 
j 30 3470). The neuron-specific Hu proteins are one of the earliest neuronal markers 

i 

produced in teratocarcinoma cells following retinoic acid (RA)-treatment to induce 
neuronal differentiation (Antic ef a/., supra] Gao and Keene (1996) J. Cell Sci. 109, 
579-589). 

In one embodiment, the ligand used to carry out the invention is a RNA 
35 binding protein selected from the RNA Recognition Motif (RRM) family of cellular 

proteins involved in pre-messenger RNA processing. One example of such a protein 
is the U1A snRNP protein. More than 200 members of the RRM superfamily have 
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been reported to date, the majority of which are ubiquitously expressed and 
conserved in phylogeny (Query et al, Ce//(1989) 57: 89-101; Kenan et al, Trends 
Biochem. Sci. (1991) 16: 214-220). Most are known to have binding specificity for 
polyadenylate mRNA or small nuclear ribonucleic acids (e.g. U1, U2, etc.) transfer 

5 RNAs, 5S or 7S RNAs. They include but are not limited to hnRNP proteins (A, B, C, 
D, E, F, G, H, l f K, L), RRM proteins CArG, DT-7, PTB, K1, K2, K3, HuD, HUC, rbp9, 
elF4B, sxl, tra-2, AUBF, AUF, 32KD protein, ASF/SF2, U2AF, SC35, and other 
hnRNP proteins. Tissue-specific members of the RRM family are less common, 
including IMP, Bruno, AZP-RRMI, X16 which is expressed in pre-B cells, Bj6 which is 

10 a puff-specific Drosophila protein and ELAV/Hu, which are neuron specific. 

RNA-binding and RNA-associated proteins useful in the practice of the 
present invention include those isolated using autoimmune and cancer patient sera. 
A non-comprehensive list of RNA-binding and RNA-associated proteins useful in the 
practice of the present inventions is set forth below in Table 1. 

15 

Table 1 

RNA Binding and RNA Associated Proteins 



SLBP 


DAN 


TTP 


Hel-N1 


Hel-N2 


elF-4A 


elF-4B 


elF-4G 


elF-4E 


elF-5 


elF-4EBP 


MNK1 


PABP 


p62 


KOC 


p90 


La 


Sm 


Ro 


U1-70K 


AUF-1 


RNAse-L 


GAPDH 


GRSF 


Ribosomal Po,P1,P2/L32 


PM-Sci 


FMR 


Stauffen 


Crab 95 


TIA-1 


Upf1 


RNA BP1 


RNA BP2 


RNA BP3 


CstF-50 


NOVA-1 


NOVA-2 


CPEBP 


GRBP 


SXL 


SC35 


U2AF 


ASF/SF2 


ETR-1 


IMP-1 


IMP-2 


IMP-3 


ZBP 


LRBP-1 


Barb 


PTB 


uPAmRNA BP 


BARB1 


BARB2 


GIFASBP 


CYP mRNA BP 


IRE-BP 



/ 
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Table 1, continued. 



p50 


RHA 


FN mRNA BP 


AUF-1 


GA mRNA BP 


Vigillin 


ERBP 


CRD-BP 


HuA 


HuB 


HuC 


HuD 


hnRNPA 


hnRNP B 


hnRNP C 


hnRNP D 


hnRNP E 


hnRNP F 


hnRNP G 


hnRNP H 


hnRNP K 


hnRNP L 


U2AF 





The identification of new (i.e., novel, previously unknown) RNA-binding 
proteins (RBPs) and RNA associated proteins, is another aspect of the invention. 
5 Thus, in one embodiment of the invention, a RNA of interest (depicted in Figure 8 as 
"RNA Y") is used as a "bait" to trap a new RBP or RAP. In a preferred embodiment, 
RNA Y is first converted to a cDNA using standard molecular biology techniques and 
is subsequently ligated at the 3' or 5' end to another fragment of DNA (referred to 
herein as the "tagging DNA") that encodes a sequence (e.g., a RNA) that will bind a 

10 ligand of the present invention (the ligand being illustrated as protein "X" in Figure 
8). In other words, the tagging DNA encodes a binding partner of the ligand. Useful 
ligands may, in some embodiments, be obtained from (i.e., by using) the serum of a 
subject (i.e., a human or animal subject) that has a disorder known to be associated 
with the production of mRNP-complex specific antibodies or proteins, including 

15 autoimmune disorders and cancer. Useful binding partners include antibodies to the 
ligand. 

The resulting DNA chimera is fused to a promoter in an expression vector 
(e.g., a plasmid) and expressed in living cells (e.g., in a cell culture) to produce a 
RNA fusion molecule. In an alternative embodiment, the expression vector is 

20 infected into the cells by a virus, preferably a recombinant virus. A cell-free extract 
from the culture is prepared and contacted with the ligand (e.g., protein X) which has 
been immobilized on a solid support. After an incubation period, the ligand and the 
attached/associated RNA fusion molecule and its associated RBPs or RAPs are 
washed to remove residual cellular material. After the wash step, the RBPs or RAPs 

25 are removed from the RNA-protein complex and analyzed (e.g., sequenced using 
standard methods of microsequencing). 

Once partial protein sequence is obtained, the corresponding gene may be 
identified from known databases containing cDNA and genomic sequences. 
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Preferably, the gene is isolated, the protein is expressed, and an antibody is 
generated against the recombinant protein using known techniques. The antibodies 
are then used to recover and confirm the identity of the endogenous RBP or RAP. 
Subsequently, the antibody can be used for ribonomic analysis (see examples 

5 below) to determine the subset of cellular RNAs that cluster with (i.e., associate with) 
RNA Y. Furthermore, the RBP or RAP may be tested for its ability to regulate the 
translation of the protein encoded by RNA Y, and may be tested for validation as a 
drug target. Likewise, proteins encoded by the cellular RNAs that cluster with RNA Y 
may be tested for validation as drug targets, as further described herein. 

10 Antibodies that specifically bind mRNP complexes are thus an aspect of the 

invention. Antibodies to mRNP complexes may be generated using methods that are 
well known in the art. Such antibodies may include, but are not limited to, polyclonal, 
monoclonal, chimeric, single chain, Fab fragments, and fragments produced by a 
Fab expression library. Antibodies and fragments thereof may also be generated 

15 using antibody phage expression display techniques, which are known in the art. 

For the production of antibodies, various hosts including goats, rabbits, rats, 
mice, humans, and others, may be immunized by injection with the mRNP complex 
or any fragment or component thereof which has immunogenic properties. 
Depending on the host species, various adjuvants may be used to increase an 

20 immunological response. Such adjuvants include, but are not limited to, Freund's, 
mineral gels such as aluminum hydroxide, and surface active substances such as 
lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet 
hemocyanin, and dinitrophenol. Among adjuvants used in humans, BCG (Bacilli 
Calmette-Guerin) and Corynebacterium parvum are especially preferable: 

25 Monoclonal antibodies to the components of the mRNP complex may be 

prepared using any technique which provides for the production of antibody 
molecules by continuous cell lines in culture. These include, but are not limited to, 
the hybridoma technique, the human B-cell hybridoma technique, and the EBV- 
hybridoma technique (Kohler, G. etal. (1975) Nature 256:495-497; Kozbor, D. etai 

30 (1985) J. Immunol. Methods 81 :31-42; Cote, R. J. ef a/. (1983) Proc. Natl. Acad. Sci. 
80:2026-2030; Cole, S. P. ef a/. (1984) Mol. Cell Biol. 62:109-120). Briefly, the 
procedure is as follows: an animal is immunized with the mRNP complex or 
immunogenic fragments or conjugates thereof. Lymphoid cells (e.g. splenic 
lymphocytes) are then obtained from the immunized animal and fused with 

35 immortalizing cells (e.g. myeloma or heteromyeloma) to produce hybrid cells. The 
hybrid cells are screened to identify those which produce the desired antibody. 
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Antibodies may also be produced by inducing in vivo production in the 
lymphocyte population or by screening immunoglobulin libraries or panels of highly 
specific binding reagents as disclosed in the literature (Orlandi, R. et a/., Proc. Natl. 
Acad. Sci. 86, 3833-3837 (1989)); Winter, G. et a/., (1991) Nature 349, 293-299 
5 (1991)). 

Antibody fragments that contain specific binding sites for mRNP complexes 
may also be generated. For example, such fragments include, but are not limited to, 
the F(ab') 2 fragments which can be produced by pepsin digestion of the antibody 
molecule and the Fab fragments which can be generated by reducing the disulfide 
10 bridges of the F(ab') 2 fragments. Alternatively, Fab expression libraries may be 
constructed to allow rapid and easy identification of monoclonal Fab fragments with 
the desired specificity (Huse, W. D. etal. (1989) Science 254:1275-1281). 

Various immunoassays may be used for screening to identify antibodies 
having the desired specificity for the mRNP. Numerous protocols for competitive 
15 binding or immunoradiometric assays using either polyclonal or monoclonal 
antibodies with established specificities are well known in the art. Such 
immunoassays will typically involve the measurement of complex formation between 
the component of the mRNP complex and its specific antibody. A two-site, 
monoclonal-based immunoassay utilizing monoclonal antibodies reactive to two non- 
20 interfering epitopes is preferred, but a competitive binding assay may also be 
employed. 

Kits or devices (e.g., fluidic devices) containing columns in which antibodies 
to various mRNP complexes or components thereof (e.g., antibodies to RNA-binding 
proteins) are immobilized are another aspect of the invention. The antibodies may 

25 be conjugated to a solid support suitabie for a diagnostic assay (e.g., beads, plates, 
slides or wells formed from materials such as latex or polystyrene) in accordance 
with known techniques, such as precipitation. Antibodies may likewise be conjugated 
to detectable groups such as radiolabels (e.g., 35 S, 125 l, 131 l), enzyme labels (e.g., 
horseradish peroxidase, alkaline phosphatase), and fluorescent labels (e.g., 

30 fluorescein) in accordance with known techniques. Devices of the present invention 
will preferably include at least one reagent specific for detecting the binding between 
an antibody to a mRNA-binding protein and the protein itself. The reagents may also 
include ancillary agents such as buffering agents and protein stabilizing agents, e.g., 
polysaccharides and the like. The device may further include, where necessary, 

35 agents for reducing background interference in a test, control reagents, apparatus for 
conducting a test, and the like. The device may be packaged in any suitable manner, 
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typicaliy with all elements in a single container along with a sheet of printed 
instructions for carrying out the test. 

Certain embodiments of the invention relate to methods of screening test 
compounds for therapeutic, diagnostic or pharmaceutical use, based upon each 

5 compound's effect on the ribonomic profile of a cell or tissue sample. In an example 
of such an embodiment, cells are grown under conditions where the cells have no 
contact with the test compound (/.e M the cells are not treated). A ribonomic profile of 
the cell type is then produced, and the mRNA subsets identified. The ribonomic 
profile of the non-treated cells is then compared to a ribonomic profile of the same 

10 cell type that has been treated with the test compound. Any difference between the 
two profiles is an indication that the test compound has an effect (directly or 
indirectly) on the expression of certain genes of the cell, and may be an indication 
that the test compound is a candidate for therapeutic or diagnostic use. 
Alternatively, the ability of a compound to effect gene expression may identify the 

15 gene as a target for further testing. A "difference" in the profiles refers to any 

modulation or change in expression between the two profiles. "Modulation" can refer 
to an increase in expression, a decrease in expression, a change in the type or kind 
of expression present, a complete cessation of expression {i.e., an absence of 
expression), or the instigation of expression. Suitable compounds that may be used 

20 include but are not limited to proteins, nucleic acids, small molecules, hormones, 
antibodies, peptides, antigens, cytokines, growth factors, pharmacological agents 
including chemotherapeutics, carcinogenics, or other cells {I.e. cell-cell contacts). 
Cells may also be screened for the effects of environmental or physiological factors 
such as radiation, action potentials, etc. on normal gene expression. 

25 In another embodiment of the invention, an mRNP component itself, its 

catalytic or immunogenic fragments or oligopeptides thereof, can be used for 
screening libraries of compounds in any of a variety of drug screening techniques. 
The fragment employed in such screening may be free in solution, affixed to a solid 
support, borne on a cell surface, or located intracellular^. The binding between the 

30 mRNP complex and the compound being tested may be measured. 

Another technique for drug screening which may be used provides for high 
throughput screening of compounds having suitable binding affinity to the protein of 
interest as described in published PCT application WO84/03564. In one 
embodiment, as applied to the mRNP complex, a plurality of different test 

35 compounds are synthesized on or affixed to a solid substrate, such as plastic pins or 
some other surface. The test compounds are reacted with the mRNP complex, or 



-23- 

fragments and/or components thereof, and washed. The bound mRNP complex or 
component thereof is then detected by methods well known in the art. 

In summary, the present invention provides powerful in vivo methods for 
determining the ribonomic profile of a cell and detecting changes in the same. The 

5 invention has numerous uses, including but not limited to the monitoring of tumor 
development, state of growth or state of development, perturbations of a biological 
system such as disease, drug or toxin treatment, and the state of cell aging or death. 
The invention also finds use in distinguishing ribonomic profiles amongst organisms 
such as plant, fungal, bacterial, viral, protozoan, or animal species. 

10 The present invention can be used to discriminate between transcriptional 

and post-transcriptional contributions to gene expression and to track the movement 
of RNAs through RNP complexes, including the interactions of combinations of 
proteins with RNAs in RNP complexes. Accordingly, the present invention can be 
used to study the regulation of stability of RNAs. The present invention can be used 

15 to investigate the activation of translation of mRNAs as single or multiple species by 
tracking the recruitment of mRNAs to active polysomes, measuring the sequential, 
ordered expression of mRNAs, and measuring the simultaneous, coordinate 
expression of multiple mRNAs. The present invention can also be used to determine 
the trans-acting functions of RNAs themselves upon contacting other cellular 

20 components. These and numerous other uses will be made apparent to the skilled 
artisan upon study of the present specification and claims, 

The following Examples are set forth to illustrate the present invention, and are 
not to be construed as limiting thereof. 

25 EXAMPLE 1 

RNase Protection in a Multiprobe System: Materials and Methods 

It has previously been reported that HuB (Hel-N1) immunoprecipitation, using 

a g10 epitope tag, resulted in the co-immunoprecipitation of a mRNA, which once 

amplified by RT-PCR and sequenced, was found to encode NF-M protein (Antic, 

30 1 999, supra). In this example, the same approach is expanded to using a multiprobe 
RNase protection assay to rapidly optimize the immunoprecipitation of several 
endogenous mRNA-protein (mRNP) complexes containing different mRNA-binding 
proteins. In the multi-probe system, many mRNAs, from mRNP pellets, can be 
assayed in a single lane of polyacrylamide gel. 

35 Cell Culture and Transformation. Murine P1 9 embryonal carcinoma cells 

were obtained from the ATCC and maintained in monolayer culture using ct-MEM 
without phenol red (Gibco-BRL 41061-029) supplemented with 7.5% Bovine Calf 
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Serum, 2.5% Fetal Bovine Serum (Hyclone) and 100U Penicillin/Streptomycin. Cells 
were grown in tissue culture flasks or plates that had been pre-coated with 0.1% 
gelatin (Sigma Chemicals) that was removed prior to use. Monolayer cell cultures 
were maintained in 5% C0 2 at 37°C. 

5 P19 cells were stably transfected with a SV40 promoter-driven pAlpha2- 

gene10-HuB plasmid that ectopically expressed a genelO-tagged neuron-specific 
HuB protein termed Hel-N2 (Gao etal. (1994) Proc. Natl. Acad. Sci. USA 91, 11207- 
11211). The transfected plasmid was maintained by supplementing the medium with 
0.2 mg/ml G418 (Sigma Chemicals). Although it lacks thirteen amino acids from the 

10 hinge region connecting RNA-recognition motifs (RRMs) 2 and 3 of Hel-N1 , the 

RRMs are identical and in vitro binding experiments have indicated no differences in 
the AU-rich RNA binding properties of Hel-N1 and Hel-N2 (Gao et al. (1994) Proc. 
Natl. Acad. Sci. USA 91, 11207-11211; Abe et al. (1996) Nucleic Acids Res. 24, 
2011-2016; unpublished results). 

15 Antibodies. Monoclonal anti-gene 10 (g10) antibodies were produced as 

previously described (Gao et al. (1994) Proc. Natl. Acad. Sci. USA 91, 1 1207-1 121 1 ; 
Antic ef al. (1999) Genes Dev. 13, 449-461). Polyclonal sera reactive with HuA were 
produced as previously described (Levine etal. (1993) Mol. Cell. Biol. 13, 3494-3504; 
Atasoy et al. (1998) J. Cell Sci. 111,3145-3156). Antibodies reactive with Poly A 

20 binding protein (PABP) were kindly provided by Dr. N. Sonenberg of McGill 
University (Canada). 

Preparation of Cell-Free Extracts. Cells were removed from tissue culture 
plates with a rubber scraper and washed with cold PBS. The cells were 
resuspended in approximately two pellet volumes of polysome lysis buffer (PLB) 

25 containing 100 mM KCI, 5 mM MgCL 2l 10 mM HEPES pH 7.0, and 0.5% NP-40 with 
1 mM DTT, 100 U/mL RNase OUT (GIBCO-BRL), 0.2% vanadyl ribonucleoside 
complex (VRC)(GIBCO-BRL), 0.2 mM PMSF, 1 mg/mL pepstatin A, 5 mg/mL 
bestatin, and 20 mg/mL leupeptin added fresh at the time of use. The lysed celis 
were then frozen and stored at -100°C. At the time of use, the cell lysate was thawed 

30 and centrifuged at 1 2,000 rpm in a tabletop microfuge for 10 min at 4°C. The 

supernatant was removed and centrifuged a second time at 16,000 rpm in a tabletop 
microfuge for 5 min at 4°C before being stored on ice or refrozen at -100°C. The 
mRNP cell lysate contained approximately 30-50 mg/mL total protein. 

Immunoprecipitations. For immunoprecipitation, Protein A sepharose beads 

35 (Sigma Biochemicals) were swollen 1 :5 v/v in NT2 buffer (50 mM Tris pH 7.4, 1 50 
mM NaCI, 1 mM MgCI 2 , and 0.05% NP-40) supplemented with 5% bovine serum 
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albumin. A 300 aliquot of 1:5 v/v pre-swollen Protein A bead slurry was used per 
immunoprecipitation reaction and incubated overnight at 4°C with excess 
immunoprecipitating antibody (typically 5-20 jiL, depending on the reagent). The 
antibody-coated Protein A beads were washed 5 times with ice-cold NT2 buffer and 
5 resuspended in 900 jiL of NT2 buffer supplemented with 100 U/mL RNase OUT, 
0.2% VRC, 1 mM DTT, and 20 mM EDTA. The beads were briefly vortexed and 100 
|iL of the mRNP cell lysate was added. The beads were immediately centrifuged 
and 100 jaL of the supernatant was removed to represent total cell mRNP lysate 
(essentially one-tenth the quantity of lysate used in the mRNP 

10 immunoprecipitations). The immunoprecipitation reaction and an aliquot removed to 
represent total cell mRNP lysate were tumbled at room temperature for a time period 
of from zero time to up to two hours. Following incubation, the Protein A beads were 
washed four times with ice-cold NT2 buffer followed by two washed with NT2 buffer 
supplemented with 1 M urea. Washed beads were resuspended in 100 |il_ NT2 

15 buffer supplemented with 0.1% SDS and 30 jag proteinase K and incubated for 30 
min in a 55°C water bath. Following proteinase K digestion, immunoprecipitated 
RNA was isolated with two phenol/chloroform/isoamyl alcohol extractions and 
ethanol precipitated. 

RNase Protection Assays. After mRNP complexes were immunoprecipitated 
20 from cell lysates and the bound RNA extracted, it was assayed by RNase protection 
using the PharMingen Riboquant assay (San Diego, CA) according to the 
manufacturer's instructions (4501 4K). Briefly, extracted RNA was hybridized with 
excess 32 P-Iabeled riboprobes generated from templates specific for mRNAs 
encoding L32, GAPDH, several murine Myc-related proteins (template set 45356P) 
25 and cyclins (template set 45620P). Non-duplexed RNA was digested by treatment 
with RNase A+T1 . The resulting fragments were resolved by denaturing 
polyacrylamide/urea gel electrophoresis. Because the length of the riboprobe for 
each mRNA species was a unique size, all detectable mRNA species in a sample 
could be resolved in a single gel lane. Protected riboprobe fragments were 
30 visualized on a phosphoimaging screen (Molecular Dynamics) after 24 hours of 
exposure. Phosphoimages were scanned using the Molecular Dynamics STORM 
860 System at 100 micron resolution and analyzed using Molecular Dynamics 
ImageQuant Software (V 1.1). 
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EXAMPLE 2 

RNase Protection in a Multiprobe System: Experimental Results 
Figure 3 shows an immunoprecipitation of HuB and Poly-A binding protein 
(PABP)-mRNP complexes from extracts of murine P19 cells stably transfected with 
g10-HuB cDNA. No mRNAs were detected in pellets immunoprecipitated with 
polyclonal pre-bleed rabbit sera (Figure 3A and 3B t lane 3), or with many other 
rabbit, mouse, and normal human sera tested with this assay (data not shown). The 
profiles of mRNAs associated with HuB mRNP complexes included n-myc, 1-myc, b- 
myc, max and cylins A2, B1, C, D1, and D2, but not sin3, cyclin D3, cyclin B2, L32 or 
GAPDH mRNAs (Figure 3A and 3B, lane 4). In contrast, the profiles of mRNAs 
extracted from PABP-mRNP complexes resembled the profiles of total RNA, but 
showed enriched levels of L32 and GAPDH and decreased levels of sin3 mRNA 
(Figure 3A and 3B, lane 5). It was concluded that antibodies reactive with these 
cellular RNA-binding proteins could be used to immunoprecipitate mRNP complexes 
and to recover mRNAs with which they are specifically associated. These results are 
consistent with the postulate role of Hu proteins in regulating post-transcriptional 
gene expression during cell growth and differentiation. (Jain ef al. (1997) Mol. Cell. 
Biol. 17, 954-962; Levy ef al. (1998) J. Biol. Chem. 273, 6417-6423; Fan and Steitz 

(1998) EMBO J. 17, 3448-3460; Peng et al. (1998) EMBO J. 17, 3461-3470; Antic 
and Keene (1997) Am. J. Hum. Genet. 61, 273-278; Levine etal. (1993) Mol. Cell 
Biol. 13, 3494-3504; Gao ef al. (1 994) Proc. Natl. Acad. Sci. USAI 91,11 207-1 1211; 
King ef al. (1994) J. Neurosci. 14, 1943-1952; Liu etal. (1995) Neurology 45, 544- 
550; Ma etal. (1996) J. Biol. Chem. 271, 8144-8151; Abe etal. (1996) Nucl. Acids 
Res. 24, 201 1-2016; Antic ef al. (1999) Genes Dev. 13, 449-461; Chung ef al. 
(1997) J. Biol. Chem. 272, 6593-6598; Akamatsu ef al. (1999) Proc. Natl. Acad. Sci. 
USA 96, 9884-9890; Sachs ef al. (1997) Cell 89, 831-838; Aranda-Abreu et al. 

(1999) J. Neurosci. 19, 6907-6917). 

EXAMPLE 3 

Identification of mRNA Subsets Associated with RNA Binding Proteins 
En Masse Using cDNA Arrays: Materials and Methods 

To further expand the ability to identify the mRNAs associated in endogenous 
mRNP complexes, this example describes the use of a cDNA array filter as a highly 
specific and sensitive method to detect a mRNA subset without amplification or 
iterative selection (Figure 4). 



-27- 

Antibodies. Monoclonal anti-gene 10 (giO) antibodies were produced as 
previously described (see D. Tsai et a/., Proc. Natl. Acad. Sci. USA, 89, 8864-8868 
(1992); Gao etal. (1994) Proc. Natl. Acad. Sci. USA 91, 11207-11211; Antic etal. 
(1999) Genes Dev. 13, 449-461). Polyclonal sera reactive with Hu proteins were 

5 produced as previously described (Levine et ai (1993) Mol. Cell. Biol. 13, 3494-3504; 
Atasoyefa/. (1998) J. Cell Sci. 111,3145-3156). Antibody against 5'cap binding 
protein (elF-4E) was obtained from Transduction Laboratories (San Diego, CA). 
Antibodies reactive with Poly A binding protein (PABP) were kindly provided by Dr. N. 
Sonenberg of McGill University (Canada). 

10 Cell Culture and Differentiation. Preparation of transgenic cells was as 

described in Example 1. Chemical treatment with retinoic acid (RA) was used to 
induce neuronal differentiation by treating 5 x 10 5 P19 cells, placed a 60 mm petri . 
dish (Fisher Scientific, Pittsburgh, PA, USA, Number 8-757-1 3A), with 0.5 RA 
(Sigma Chemicals, St. Louis, MO, USA, Number R2625), as described in Gao and 

15 Keene (1996)). After two days, 25% of the cells that had formed into clumps were 
removed, placed in new petri dishes, and supplemented with fresh medium and RA. 
Following an additional two days, cell aggregates were washed once with 
phosphate-buffered saline (PBS) and trypsinized. The cells were then plated into 
two 100 mm gelatin-coated tissue culture plates. Cells were harvested after an 

20 additional four days. The RA treated HuB (Hel-N2) stably transfected P19 cells grew 
neurites and displayed characteristics neuronal markers and morphology, but did not 
terminally differentiate and remained susceptible to killing with mitotic inhibitors. 
Cell-free extracts and immunoprecipitations were as described in Example 1 . 

cDNA Array Analysis. cDNA array analysis was performed using Atlas™ 

25 Mouse Arrays (Clontech, Inc., Palo Alto, CA) that contain a total of 597 cDNA 

segments spotted in duplicate, side-by-side on a nylon membrane. Probing of cDNA 
arrays was performed as described in the Clontech Atlas™ cDNA Expression Arrays 
User Manual (PT3140-1). Briefly, RNA was extracted f r om HuB stably transfected 
P19 embryonal carcinoma cells and used to produce reverse transcribed probes. A 

30 pooled set of primers, complementary to the genes represented on the array, was 
used for the reverse transcription probe synthesis, which was radiolabeled with 32 P 
a-dATP. The radiolabeled probe was purified by passage over CHROMA SPIN™- 
200 columns (Clontech, Inc., Palo Alto, CA) and incubated overnight with an array 
membrane using ExpressHyb™ hybridization solution (Clontech, Inc., Palo Alto, CA). 

35 Following hybridization, the array membrane was washed and visualized on a 
phosphorimaging screen (Molecular Dynamics, Sunnyvale, California, USA). 
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Phosphorimages were scanned using the Molecular Dynamics STORM 860 
System at 100 micron resolution and stored as files. Images were analyzed using 
Atlaslmage™ 1.0 and 1.01 software (Clontech, Inc., Palo Alto, CA). The signal for 
any given gene was calculated as the average of the signals from the two duplicate 

5 cDNA spots. As described in the Atlaslmage™ 1.0 software manual (Clontech, Inc., 
Palo Alto, CA), a default external background setting was used in conjunction with a 
background-based signal threshold to determine gene signal significance. The 
signal for a gene was considered significantly above background if its adjusted 
intensity (total signal minus background) was more then two-fold the background 

10 signal. Comparisons of multiple cDNA array images were performed using an 

average of all the gene signals on the array (global normalization) to normalize the 
signal intensity between arrays. Changes in the mRNA profile of HuB mRNP 
complexes in response to retinoic acid treatment were considered significant if they 
were four-fold greater (twice the stringency typically used for establishing 

15 significance of a gene expression change). cDNA array images and overlays were 
prepared using Adobe Photoshop® 5.0.2 (San Jose, California, USA). 



EXAMPLE 4 

20 Identification of mRNA Subsets Associated with RNA Binding Proteins 

En Masse Using cDNA Arrays: Experimental Results 

Results. After assessing the overall gene expression profile of HuB 
transfected P19 cells (the transciptome), HuB and PABP mRNA complexes, as well 

25 as elF-4E mRNP complexes were separately immunoprecipitated and captured 
nriRNAs were identified on cDNA arrays. The initial alignment of these arrays was 
facilitated by spiking the hybridization reaction with radiolabeled lambda phage 
markers that hybridized with six DNA spots on the bottom of the array membrane. 
Once the alignment register was established, subsequent blots did not require the 

30 use of spiked lambda markers for orientation. 

Arrays generated from immunoprecipititaions with rabbit pre-bleed sera were 
essentially blank with the exception of the spiked lambda markers observed at the 
bottom of the array (Figure 5A). Immunoprecipitated HuB mRNP and elF-4E mRNP 
complexes each contained slightly more than 10% of the mRNAs detected in total 

35 cell RNA, but differed considerably from one another (Figure 5B, 5C, and 5E). 

Like HuB and elF-4E, PABP has been implicated in facilitating mRNA 
stabilization and translation (Ross (1995) Microbiol. Rev. 59, 423-450; Ross (1996) 
Trends Genet. 12, 171-175; Wickens etal. (1996) in Translational control, eds 
Hershey, J.W.B, Mathews, M.B. & Sonenberg, N., Cold Spring Harbor Laboratory 
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Press, Cold Spring Harbor, pp. 411-450; Sachs etal. (1997) Ce//89, 831-838). Not 
suprisingly, PABP mRNPs contained many more detectable mRNAs than those 
observed in the HuB or elF-4E mRNPs (Figure 5D). As expected, the profile of the 
mRNAs in the PABP mRNPs from these cells closely resembled that of the 

5 transcriptome. However, as was seen for HuB and elF-4E mRNPs, some mRNAs 
were enriched or depleted in the PABP-mRNPs as compared to the total RNA 
(Figure 5D and 5E). The profiles and relative abundance of mRNAs detected in 
these mRNP complexes were highly reproducible, but the absolute number of mRNA 
species detectable on the phosphorimages occasionally varied as a result of 

10 differences in the specific activity of the probe. 

Because the cDNA arrays derived using total RNA were generated using one- 
tenth the quantity of lysate used for mRNP immunoprecipitations, a comparison of 
the absolute quantities of each mRNA detected in mRNP complexes with those 
observed in the total RNA was not conducted. A more accurate result was obtained 

15 by comparing the relative abundance of each mRNA species to each other within 
each microarray. For example, the relative abundance of the mRNA encoding p- 
actin and ribosomal protein S29 (Figure 5, arrows a and b, respectively) are 
approximately equal to the total cellular RNA, but varied dramatically among each of 
the mRNP complexes. Many other examples of this distinction are readily apparent 

20 in Figure 4. These findings indicated that the mRNA profiles detected in HuB, elF- 
4E, and PABP mRNP complexes are distinct from one another and from those of the 
transcriptome. 

EXAMPLE 5 

25 Alterations in mRNP complexes in response to retinoic acid (RA) 

Since HuB is predominantly a neuronal protein believed to play a role in 
regulating neuronal differentiation, studies were conducted to investigate whether the 
mRNA population fcund in HuB mRNP complexes changes in response to RA, a 

30 chemical inducer of neuronal differentiation. HuB transfected P19 cells were treated 
with RA to induce the onset of neuronal differentiation, HuB mRNP complexes were 
immunoprecipitated, and then associated mRNAs were identified on cDNA arrays. 
Comparison of the mRNA profiles extracted from the HuB mRNPs before and after 
RA treatment revealed that eighteen mRNAs were either exclusively present or 

35 greatly enriched (four-fold or greater) in RA-treated HuB mRNPs (Figure 6A, 6B, 
and 6C, red bars). In addition, three mRNAs (T-lymphocyte activated protein, DNA- 
binding protein SATB1, and HSP84) decreased in abundance by four-fold or greater 
in response to RA treatment (Figure 6C, blue bars). To determine if the changes 
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observed in the mRNA profile of the HuB mRNA complexes were unique, the 
ubiquitously expressed ELAV family member HuA (HuR) was immunoprecipitated 
from these RA treated cells. Although there were a few changes to the HuA mRNP 
profile following treatment with RA, they were minor in comparison with HuB. 

5 The changes in the HuB-associated mRNA profile in response to RA 

treatment did not merely reflect changes in the total cellular mRNA (Figure 6G, 6H, 
and 61). Numerous examples of differentially-enriched or depleted mRNAs detected 
in HuB mRNP complexes are evident by comparing Figure 6C and 61. For 
comparative purposes, this is depicted in Figure 7 by realignment and enlargement 

10 of representative spots. For example, IGF-2 mRNA was detectable only in total RNA 
and HuB mRNP complexes from FRA-treated cells (Figure 7). However, other HuB- 
mRNP-bound mRNAs, such as integrin beta, cyclin D2, and Hsp84 increased or 
decreased in abundance disproportionately to their changes in the total RNA profile 
following RA treatment (Figure 7). The disparity between changes in the mRNA 

15 profiles of total RNA and HuB mRNPs possibly results from changes in 

compartmentalization of mRNAs that flux dynamically through mRNP complexes in 
response to RA treatment. It can be concluded that the mRNA profiles derived from 
these mRNP complexes are dynamic and can reflect the state of growth, as well as 
changes in the cellular environment in response to a biological inducer like retinoic 

20 acid. 



EXAMPLE 6 

In vivo Target Sequence Preferences For RNA-Binding Proteins 

25 Using GenBank and EST databases, the 3' UTR sequences from mRNAs 

enriched in RA-treated HuB-mRNP complexes were identified (TABLE 2). 

TABLE 2 

Gene 3MJTR Consensus Sequence 

CD44 UUUUCUAUUCCUUUUUUAUUU UAUGUCAUUUUUUUA 



UAAAAAACCAAA UUUGAUU GGCUCUAAACA 
IGF-2 UAAAGAAAUUAAUU GGCUAAAAACAUA 

CUAAAAAUUAAUU GGCUUAAAAA 

UCACUCUU UAUUAUU AU 
HOX 2.5 AAAU UUUAUUA AGUUA 

AUCAGGUUCAUUU UGGUUGU 

Inhibitor AU UUUAUCU GUUA 

J6 UUUUGUUUUUCUCCCUUUUUUAGUUU UUUCAAA 



GADD45 



UAUUUUUUUUCUUUUUUUU UUUUGGU CUUUAU 
UUAAAUUCUCAGAAGUUUUAUUA UAAAUCUU 



UUCUGUUAAAUAUU UUUAUAU ACUGCUUUCUUUUUU 
Nexin 1 AUUUUAUAGUAGUU UUUAUGU UUUUAUGGAAAA 

AUUUGCCUUUUUAAUU CUUUUU 



-31- 



Egr-1 UAUUUUGUGGUUUUAUUU UACUUUGUACUU 

Zif 268 UUUUGUUUUCCUU 

UUU UUUAUUU UCUGUAUUUUUU 
Neuronal- UUUUUUUUAAAUUUU UUUAUUU UCUUUUU 

Cadherin UUUUUUAUUUUC UGUAUUU UUU 

UUUUUAAUUUUUUAAUU UUUUUU 

Integrin AAUGG UUUAUAU UUAUGAU 

alpha 5 UUGUUUAUAU CUUCAAU 

SEF2 UUCAAGCGCUUGANUU 



Cf2r UGCAUCGAUCCGUUGAUUUACUACU 

Integrin UAUAAUUU UUAAUUU UUUAUUAUUUU 

beta UAUUUUACCUUUUUUUUUUUUCUUUAAUU CCUGGU 



CTCF 



UUAUGAAUGU U AU AUUU GU 

UCUUAAUUU UUUCUCUUUUUUUUCU 



UUUUUUUUUCCU UUUAAUU GUAAAUGGUUCUUU 
UUAAUGAUCAUUCAGAUUGUA UAUAUUU GUUUCCUUU 
UUCAAUUUUU UUUAUAU ACUAUCUU 

TGF beta 2 UUUUUC- UUUAAUU GGUUUUUUUMTP 

UGUCUUGUTCUGAGCA UUUAUUU UCAAA 
UUCUCGUCUUG UUUAUUU UACAA 

UAUAAUAAUAGUUUAUGU UUUGGAUGUUUGGU 

Cyclin D2 AUGUCUUGUUCUUUGUGUUU UUAGGAU 



(AU/GA) UUUAUUU (UA/AG) 
In Vitro Consensus Sequence 



Many of the mRNAs for which 3' UTR sequences were available contained 
similar uridylate-rich motifs as those previously found to bind to Hu protein in vitro 
(Levine ef al. (1993) Mol. Cell Biol. 13, 3494-3504; Gao et al. (1994) Proc. Natl. 

5 Acad. Sci. USA 91,11 207-1 1211; King et al (1 994) J. Neurosci. 1 4, 1 943-1 952; Liu 
era/. (1995) Neurology 45, 544-550; Ma ef al. (1997) Nucleic Acids Res. 25, 3564- 
3569; Fan era/. (1997) Genes Dev. 11, 2557-2568). Moreover, most of these 
mRNAs encode proteins that are expressed in neuronal tissues or are known to be 
up-regulated following RA-induced neuronal differentiation (Beck et al. (1995) Neuron 

10 14, 717-730; Colon and Rossant (1992) Development 116, 357-368; Graham et al. 
(1991) Development 112, 255-264; Hirsch ef al. (1994) Dev. Dyn. 201, 108-120; Hunt 
ef al. (1991 ) Developments, 43-50; Janssen-Timmen et al. (1989) Gene 80, 325- 
336; Kondo ef al. (1992) Nucleic Acids Res. 20, 5729-5735; Konishi ef al. (1994) 
Brain Res. 649, 53-61; Neuman ef al. (1993) Eur. J. Neurosci. 5, 311-318; Okuda et 

15 al. (1995) Genomics 29, 623-630; Soosaar et al. (1994) Bran Res. Mol. Brain Res. 
25, 176-180; Takechi ef al. (1992) Eur. J. Biochem. 206, 323-329; Telford ef al. 
(1990) Mol. Reprod. Dev. 27, 81-92; Zwartkrius etal. (1993) Exp. Cell Res. 205, 422- 
425; Tomaselli ef al. (1988) Neuron 1, 33-43; Redies (1995) Exp. Cell Res. 220, 243- 
256; Ross et al. (1996) J. Neurosci. 16, 210-219). The sequence alignment shown in 

20 TABLE 3 is consistent with the previous results of Levine et al. ((1993) Mol. Cell Biol. 
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13, 2494-3504) and Gao etal. ((1994) Proc. Natl. Acad. Sci. USA 91, 11207-11211) 
who used in vitro selection to derive a consensus RNA-binding sequence for HuB. 
Using the methods described herein, it is possible to discern in vivo target sequence 
preferences for other RNA-binding proteins. 

5 

EXAMPLE 7 

Use Of mRNA Binding Proteins To Purify Endogenous mRNP Complexes And 
To Identify Associated mRNAs En Masse Using cDNA Array Analysis 

10 Earlier attempts to identify mRNA targets of the HuB protein using high- 

throughput methods required RT-PCR amplification and in vitro iterative selection 
and identified several structurally related ERG mRNAs from neuronal tissues (Gao et 
al. (1994) Proc. Natl. Acad. Sci. USA 91, 11207-11211; Andrews and Keene (1999) 
Methods Mol. Biol. 118, 233-244). Most of these mRNAs contained ARE-like 

15 sequences in their 3'-UTRs, which is a characteristic of ERG mRNAs (Keene (1999) 
Proc. Natl. Acad. Sci. USA 96, 5-7; Levine et al. (1993) Mol. Cell Biol. 13, 3494- 
3504; Gao etal. (1994) Proc. Natl. Acad. Sci. USA 91, 11207-11211; King etal. 
(1994) J. Neurosci. 14, 1943-1952). It has been demonstrated that Hu proteins can 
bind ERG mRNAs and affect their stability and/or translational activation ( Jain ef al. 

20 (1997) Mol. Cell Biol. 17, 954-962; Levy et al. (1998) J. Biol. Chem. 273, 6417-6423; 
Fan and Steitz (1998) EMBO J. 17, 3448-3460; Peng et al. (1998) EMBO J. 17, 
3461-3470; Keene (1999) Proc. Natl. Acad. Sci. USA 96, 5-7; Levine et al. (1993) 
Mol. Cell Biol. 13, 3494-3504; Gao etal. (1994) Proc. Natl. Acad. Sci. USA 91, 
11207-11211; King ef al. (1994) J. Neurosci. 14, 1943-1952; Liu ef al. (1995) 

25 Neurology 45, 544-550; Chung et al. (1997) J. Biol. Chem. 272, 6593-6598; Antic et 
al. (1999) Genes Dev. 13, 449-461; Ma etal. (1997) Nucleic Acids Res. 25, 3564- 
3569; Aranda-Abreu ef al. (1999J J. Neurosci. 19, 6907-6917). The in vitrc approach 
of Gao ef a/. ((1 994) Proa Natl. Acad. Sci. USA 91,11 207-1 1211) yielded a distinct 
mRNA subset from human brain and medulloblastoma cells with ERG sequence 

30 characteristics. This more direct in vivo approach obviates the need for in vitro 
binding and PCR amplification. Moreover, this new approach allows the 
identification of mRNA transcripts with linked structural and functional properties, 
may of which would not be detected (and could not be detected) using in vitro 
techniques. In addition, recognizable HuB protein-RNA binding sequences were 

35 identified within the in wVo-captured mRNA subset (TABLE 2). 

The foregoing examples are illustrative of the present invention, and are not to 
be construed as limiting thereof. The invention is described by the following claims, 
with equivalents of the claims to be included therein. 



